Quantum state fusion in photons 
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Photons are the ideal carriers of quantum information for communication. Each photon can have 
a single qubit or even multiple qubits encoded in its internal quantum state, as defined by optical 
degrees of freedom such as polarization, wavelength, transverse modes, etc. Here, we propose 
and experimentally demonstrate a physical process, named "quantum state fusion", in which the 
two-dimensional quantum states (qubits) of two input photons are combined into a single output 
photon, within a four-dimensional quantum space. The inverse process is also proposed, in which 
the four-dimensional quantum state of a single photon is split into two photons, each carrying a 
qubit. Both processes can be iterated, and hence may be used to bridge multi-particle protocols of 
quantum information with the multi-degree-of-freedom ones, with possible applications in quantum 
communication networks. 



The emerging field of quantum information technol- 
ogy is based on our ability to manipulate and transmit 
the internal quantum states of physical systems, such 
as photons, ions, atoms, superconducting circuits, etc. 
[U 12] • In photonic approaches [3J 3] , much research ef- 
fort has been recently devoted to expanding the useful 
quantum space by two alternative approaches: either by 
increasing the number of involved photons [5] or by ex- 
ploiting different degrees of freedom of the same photon, 
such as polarization, time-bin, wavelength, propagation 
paths, and orbital angular momentum [6lJ16|. Hitherto, 
however, these two approaches have proceeded parallel to 
each other, with little or no attempt at combining them. 
In this paper, we introduce and prove the realizability of 
two novel quantum-state manipulation processes, namely 
fusion and fission, that can be used to bridge these two 
approaches, allowing for their full integration. Some pos- 
sible specific applications will be discussed in the conclud- 
ing paragraphs of this paper. 

Quantum state fusion is here defined as the physi- 
cal process by which the internal quantum state of two 
particles (e.g., photons) is transferred into the four- 
dimensional internal quantum state of a single particle 
(photon) . In the language of quantum information, two 
arbitrary input qubits initially attached to separate par- 
ticles are transferred into a single particle, where they 
are encoded exploiting at least four independent states 
of the same particle (i.e., a single-particle "qudit"). In 
this framework, this process may also be called "quan- 
tum information fusion" or "qubit fusion" . For example, 
let us assume that initially we have two photons, labeled 
1 and 2, having independent polarization quantum states 
(i.e., carrying polarization-encoded qubits), as follows: 

\i>) 1 =a\H) l + p\V) 1 

\4>)2=l\H) 2 + 5\V) 2 (1) 

where H and V denote the horizontal and vertical linear 
polarizations, respectively. The initial two-photon state 



can then be written as the product 

|*)i2 = (<*\H)i + P\V)i) ® (l\H) 2 + S\V) 2 ) 

= a 1 \H) l \H) 2 + ad\H) 1 \V) 2 + /3 1 \V) 1 \H) 2 
+P5\V) 1 \V) 2 

The quantum state fusion corresponds to transforming 
this state into the same linear combination of four or- 
thogonal single particle states of the outgoing photon 3: 

|*)ia -+ l*>3 = "7|0)3 + a*|l) 3 + 07|2> 3 + /M|3>3 (2) 

where |0), |1), |2), |3) denote four orthogonal internal 
states of the photon. These four states can be used to de- 
fine a four-dimensional logical basis of a "qudit" carried 
by the photon. Of course we cannot use the sole two- 
dimensional polarization space for the outgoing photon. 
One possibility is to use four independent spatial modes 
or, alternatively, two spatial modes combined with the 
two polarizations. Figure [T^, graphically illustrates the 
quantum fusion concept. 

More generally, the fusion process should work even 
for entangled quantum states, both internally entangled 
(i.e., the two particles are entangled with each other) and 
externally entangled (the two particles are entangled with 
other particles). In the first case, the four coefficients ob- 
tained in the tensor product cry, aS, /3j, j36 are replaced 
with four arbitrary coefficients a Q , a±, a 2 , a^. In the sec- 
ond case, the four coefficients are replaced with four kets 
representing different quantum states of the external en- 
tangled system. 

It should be noted that in terms of information content, 
the quantum fusion has no effect: we have two qubits ini- 
tially and the same two qubits at the end. In other words, 
state fusion is not a quantum logical gate for information 
processing (and it should not be confused with the fusion 
gate used to create quantum clusters, see e.g. [H])- How- 
ever, physically there is an important transformation, as 
the two qubits are moved from two separate particles into 
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FIG. 1. Quantum state fusion. (a) Concept: input 
photons 1 and 2, each carrying a single qubit in their two- 
dimensional internal states (shown as red and green circles), 
are merged into output photon 3, carrying both qubits in its 
four-dimensional internal space, (b) Implementation scheme 
based on CNOT gates. \tp) and \<j>) denote the qubit states 
of the two input photons. \ip) ® \<j>) denotes the 2-qubit state 
of the outgoing photon, conditional on detection of one out- 
put photon in the logical-zero output port of the Hadamard 
gate (H). c and t denote the control and target ports of the 
CNOT gates, (c) Inverse scheme for quantum state fission. 
1 10} denotes the state of an auxiliary input photon that is lo- 
calized in the upper mode, corresponding to logical-zero. The 
process success is conditional on detecting no photon in the 
logical-one output of the two H gates. 



a single one. This is analogous to what occurs in quan- 
tum teleportation, in which the information content is 
unchanged, but there is a physical transformation in the 
information localization [TM2T)] . 

To be useful in the quantum information field, the pro- 
cess of qubit fusion ideally should be also "iterable" , i.e. 
it should be possible to keep adding qubits to the same 
particle by using a larger and larger internal space. More- 
over, the process should be "invertible" , i.e. one should 
be able to split a higher-dimensional quantum state of a 
particle into two (or more) particles. We will name this 
inverse process "quantum state fission". 

The feasibility of quantum state fusion and fission in 
photons is a question of clear fundamental interest, be- 
cause photons do not interact, or interact very weakly in 
nonlinear media. Therefore, there is no straightforward 
way to transfer the quantum state from one photon to the 
other. The solution we adopt here is based on the Knill- 
Laflamme-Milburn (KLM) approach to quantum compu- 
tation with linear optics (TTl [21] . A qubit transfer from 
a particle carrier to another can be generally realized 
by the application of a single controlled-NOT (CNOT) 
logical gate. The source qubit in state a|0) c + /3|l) c 



is used as control (hence the label c) and the destina- 
tion qubit as target, which is initialized to the logical 
zero |0) t . After the CNOT, the two qubits are entangled 
in state cv|0) c |0) t + /3|l) c |l) t . A subsequent erasing of 
qubit c by projection on an unbiased linear combination 
state |+) c = (|0) c + 1 1) c )/ a/2 completes the transfer of 
the quantum state in the qubit t. In the case of fusion, 
however, we have the additional complication that the 
destination carrier photon also transports another qubit 
which must not be altered in the CNOT process. Since 
KLM gates are all based on two-photon interference (i.e. 
the Hong-Ou-Mandel effect, HOM [55]), the presence of 
the additional qubit makes the two photons partially dis- 
tinguishable and hence disrupts the gate workings. 

The solution we found to this problem is based on the 
trick of initially "unfolding" the qubit to be preserved 
into the superposition of two zero-initialized qubits and 
then acting on both qubits with a CNOT gate using the 
same control, as illustrated in Fig.fl]}. In this figure, each 
line represents a possible photonic (spatial and/or polar- 
ization) mode, so that a qubit is represented by two lines 
(this is a "path" or "dual-rail" encoding of the qubits) . 
We adopt the photon-number notation in order to be 
able to describe also "empty" qubits, i.e., vacuum states, 
which do occur in the fusion protocol. Given two modes 
forming a qubit, the 1 10) ket, where the 0s and Is re- 
fer here to the photon numbers, corresponds to having 
a photon in the first mode, encoding the logical of the 
qubit. The 1 01) ket will then represent the photon in the 
second path, encoding the logical 1 of the qubit. We will 
also need the 1 00) ket, representing a vacuum state, i.e. 
the "empty" qubit. 

The two photons whose state is to be merged travel in 
two pairs of modes that we will label as c and t, with ref- 
erence to the CNOT input qubits. The initial state of the 
two photons is taken to be the following (for simplicity 
we consider the case of two independent input states, but 
the results are valid also in the more general entangled 
case): 

\il>)t=a\W) t +P\01)t 

|0) C = 7 |1O) C + <5|O1) C (3) 

The qubit "unfolding" corresponds to adding two empty 
modes for photon t and rearranging the four modes so as 
to obtain the following state: 

|V)t = a|1000) t + /?|0010) t (4) 

The first two t modes are then treated as one qubit (t\) 
and the final two t modes as a second qubit (t2), both of 
them initialized to logical zero, but with the possibility 
for each of them to be actually empty. Each of these 
qubits is then subjected to a CNOT with the same c 
qubit, and finally the c qubit is erased by projection on 
the |+) state (this corresponds to applying a Hadamard 
gate and detecting a logical zero). A simple calculation 
(see Supplementary Materials) shows that this procedure 
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FIG. 2. Schematics of the photon quantum fusion ap- 
paratus. Each red line in this scheme is an optical spatial 
mode, corresponding to two possible photonic states given by 
orthogonal polarizations. The input photon quantum states 
are polarization-encoded (horizontal H and vertical V linear 
polarizations, corresponding to logical and 1 of photonic 
qubits, respectively) and travel in spatial modes c (control) 
and t (target). The ancilla photon must enter the setup in 
the H polarization state along mode a. The output photon 
"fused" quantum state lives in a Hilbert space obtained by 
combining the two spatial modes ti and ti and the two po- 
larizations. The PBSs are assumed to transmit the H polar- 
ization and reflect the V one. The intermediate and output 
modes are given the same label in the transmission through 
each optical component, except for the unfolding step, real- 
ized by PBS t , which splits mode t into modes ti and ti . HWPs 
oriented at 22.5° implement Hadamard gates in the polariza- 
tion space. The blue-colored HWP is oriented at —22.5° so 
as to be equivalent to a a x gate followed by a Hadamard one. 
PBSca copies the c photon state onto the ancilla, while PBS a t 
and PBS C i implement the two CNOT gates. Single photon de- 
tectors (D) are used to filter the useful output. The fusion is 
successful if the output c and a photons are both detected in 
the H channel and there is one and only one photon in each 
output channel (c,a,t), an event which occurs with a proba- 
bility of 1/32. By exploiting all four possible cases in the a 
and c polarization detection and a suitable feed-forward, the 
success probability can be raised to 1/8. 

brings the target photon into the desired "fused" state 

|*) t = (\i/>) ® \<t>))t = a 7 |1000) 4 + a*|0100) t 

+/3 7 |00T0) t +/3<S|0001) t . (5) 

Since the c qubit measurement has a probability of 50% of 



obtaining |+), the described method is probabilistic, with 
a success probability of 50% (not considering the CNOT 
success probability). However, the probability can be 
raised to 100% by a simple feed-forward procedure, i.e. 
by applying a suitable unitary transformation on the t 
photon if the c measurement yields the orthogonal state 

I— > = Clo> — 

One key aspect of the illustrated method is that, af- 
ter unfolding, each CNOT works with a target photon 
that carries no additional information, so that interfer- 
ence effects are not disrupted. On the other hand, the 
CNOT gates must work properly also in the case of empty 
target qubits, in which case the CNOT must return an 
unmodified quantum state (see Supplementary Materi- 
als for details). This is a non-trivial requirement, not 
authomatically guaranteed for all KLM approaches, par- 
ticularly when two CNOTs are used in series, as in the 
present case. 

It can be readily seen that the procedure we have de- 
scribed can be also iterated for merging additional qubits 
to the same photon. For example, in order to add a 
third qubit, we only need to insert four additional vac- 
uum modes of the target, organized in four pairs so that 
all components carrying a nonzero amplitude correspond 
to the logical 0s of the qubits. Next, we need to perform 
four CNOT operations always using as control qubit the 
new qubit to be merged. Finally, we erase the control 
and leave the eight modes of the target qubit represent- 
ing the three qubits. And so on. The inverse operation of 
quantum state fission can be also implemented in a very 
similar way to the fusion, as shown in Fig. [TJ;. Further 
details about this scheme arc given in the Supplementary 
Materials. 

Let us now consider a specific demonstrative imple- 
mentation of the fusion scheme using linear optics, in 
a KLM approach [3T]. CNOTs can be implemented in 
linear optics only probabilistically, although in principle 
the probability can be made as high as desired by intro- 
ducing an increasing number of ancilla photons. In our 
implementation, which uses polarization-encoded qubits, 
we use the CNOT schemes proposed by Pittman et al., 
based on half-wave plates (HWP) and polarizing beam- 
splitters (PBS) [23-- 26 !. Since the c photon must finally 
be erased, we can use a single photon ancilla for both 
gates, and the whole scheme becomes actually symmet- 
rical for the exchange of the two CNOTs. The overall fu- 
sion scheme, shown in Fig. [5] thus requires three photons 
only, the two to be merged and the ancilla. Its theoretical 
success probability is 1/32 without feed- forward and 1/8 
with feed-forward and it does not rely on post-selection. 
The full quantum analysis of this scheme is reported in 
the Supplementary Materials. 

The experimental apparatus is shown in Fig. [3j The 
measurements are based on detecting the fourfold co- 
incidences of the trigger and the output modes a, c, t. 
For testing the fusion apparatus, we performed quan- 
tum fusion of c and t photons prepared either in the 
logical-basis polarization states \H), \V) or in their su- 
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FIG. 3. Experimental setup for implementing and testing the quantum-state fusion, (a) Optical source used 
for generating the three input photons. Spontaneous parametric down-conversion (PDC) in a 1.5-mm-thick /3-barium borate 
crystal (BBO) cut for type-II phase matching, pumped by the second harmonic (SHG) of a Ti:Sa mode-locked laser, generates 
two polarization-entangled photon pairs at 795 nm (filtered to 3-nm bandwidth) in modes ki and k2 [30]. Each mode is then 
split by a PBS. One of the four photons is immediately detected by an avalanche photodiode (APD) and used as trigger. The 
other three are spatially filtered by coupling through single-mode (SM) fibers, have their polarization set by HWPs, and enter 
the fusion setup along input modes a, c, t. (b) Fusion setup, corresponding to the layout shown in Fig. [5] To ensure greater 
phase stability, the unfolding PBSt is implemented with a 4-cm-long calcite crystal, so as to obtain two close parallel paths 
and to use a single PBS ac t in the place of PBS c j and PBS a t. Delay lines were used for the time superposition of the photons 
in the PBSs. (c) Verification stage, used for analyzing the output photon state. The analysis in polarization is made with a 
HWP and a PBS. Superpositions of modes t\ and ti were then detected by recombining the two modes in a second calcite 
crystal and then analyzing again in polarization (this step was not needed for pure t\ or ti modes), (d) Legend of optical 
elements. Single-photon count rates were 20-100 kHz, double-concidence rates 200-1000 Hz, and fourfold coincidences few per 
hour. The degree of photon indistinguishability was quantified via HOM dip visibility 22 : we obtained 0.94±0.01 for photon 
pairs generated in orthogonal polarization states in modes ki and k2 (belonging to the same PDC pair) and 0.75±0.05 for 
photons having the same polarization (different PDC pairs). 



perposition basis states |+),|— )• The experimental re- 
sults are shown in the upper panels of Fig. |4j For each 
experimental run, a given state of the two input pho- 
tons was prepared and the final four-dimensional state 
of the output photon was measured, by projection on 
all four basis states belonging to the expected fusion 
basis (see caption of Fig. [4]). Ideally, we should find 
nonzero coincidences only when the detected state is 
the expected "fused" one, corresponding to a unitary 
fidelity. The measured experimental fidelity, averaged 



over all tested states, was instead T = (75.0 ± 1.3)%. 
The average measured fidelities for each tested basis are 
shown in the Supplementary Materials. All experimen- 
tal fidelities, both averaged and individual-state ones, are 
found to be well above the 40% state-estimation maxi- 
mal fidelity for a four-dimensional quantum state, show- 
ing that our fusion apparatus works much better than 
a measure-and-prepare trivial approach. Moreover, we 
could explain quantitatively the non-unitary observed fi- 
delities by considering the fact that the PDC process 
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FIG. 4. Experimental results. The outcome of our quantum fusion experiments (upper panels), compared with the pre- 
dictions of our model (lower panels) for a degree of photon-pair indistinguishability p = 0.77. Vertical bars give the fourfold 
count rates for different input /output combinations. At input, photon-pair states belonging to the following four bases have 
been used: (i) \H t , H c ), \H t , V c ), \V t ,H c ), \V t , V c ), (ii) \H t , + c >, \H t , - c ), \V t , + c >, \V t , - c >, (hi) \+ t ,H c ), \+ t ,V c ), \- t ,H c ), \- t , V c ), 
(iv) |+t.+ c ). |+t, — c ), |— t, +e), |— t, — c)- At output, the fused-state photon was analyzed in the following corresponding 
bases (with n = 0,1,2,3): (i) {|n» = {\H tl ),\V H ),\H H ) t \V H )}; (ii) {|n'» = {|+ tl >, |- tl >, |+t a >, \~t 2 }}; (iii) {>"}} = 
{\H t+ ),\Vt + ),\H t _),\Vt_)}; (iv) {|n"')} = {|+t + ), |— |+t_), |— Here, the output modes i + and i_ are defined ac- 
cording to the rule \H)t ± = (\H) tl ± \H)t 2 ) / y/2 and analogously for the other polarizations. 



does not actually generate perfectly identical multiple 
photon pairs. Therefore, there is always a finite degree 
of indistinguishability of the three photons that affects 
the final outcome. We have developed a detailed model 
of this effect (see Supplementary Materials), whose pre- 
dictions are shown in the lower panels of Fig. [4] The 
agreement between our model and the experiment is ev- 
ident. More quantitatively, we evaluated the similar- 

% S = (E^ sjDaDy Eii A; Eij D'ij) between the 
experimental input/output probability distribution 
and the predicted one D'^, obtaining S = (94.0 ± 0.9)%. 
We note that this imperfect indistinguishability of the 
photon pairs is not a fundamental limitation of the fusion 
process, and in principle we could improve the fidelity by 
adopting a narrower spatial and spectral filtering of the 
PDC output, although at the cost of reducing the coin- 
cidence rates. 

Before concluding, let us now briefly discuss the appli- 
cation prospects of the quantum-state fusion and fission 
processes we have introduced here. First, we notice that, 
although ours has been a bulk-optics demonstration, the 
fusion/fission processes are expected to find their ideal 
implementation framework in integrated quantum pho- 
tonics [37]. Next, as already mentioned, these processes 



may enable the interfacing of multi-particle protocols 
of quantum information, in which different qubits are 
encoded in different photons, with the multi-degree-of- 
freedom approaches, in which several qubits are encoded 
in the same particle. This combination in turn, may 
have a number of applications, particularly in multi-party 
quantum communication networks |28j . For example, 
transmitting a cluster of entangled qubits through a high- 
loss channel, e.g. an Earth-satellite link, can be done 
more efficiently by first fusing the qubits in fewer pho- 
tons, transmitting them, and then splitting them again 
after detection. Even if the simplest implementations of 
fusion/fission are not highly efficient, the overall trans- 
mission efficiency can still be boosted by a large factor, as 
it will scale exponentially in the number of transmitted 
photons. Another example may be the storing of mul- 
tiple incoming photonic qubits in a smaller number of 
multi-level matter registers (25]. Here a possible advan- 
tage would be in the overall rate of decoherence of the 
stored quantum information, that for entangled states 
will scale with the number of involved registers. A third 
example is that of exploiting the multi-degree-of-freedom 
approach in order to boost the number of qubits that 
can be processed simultaneously, but without limiting 
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the possibility of interacting with separate parties for the 
qubit input /output. We also envision many interesting 
applications in the study of fundamental issues in quan- 
tum physics. For example, by exploiting the possibility 



of fusing states that are entangled with external systems 
one may create and study complex many-particle clusters 
of entangled states. 
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SUPPLEMENTARY MATERIALS 



Appendix A: Quantum state fusion 



Let us label as c and t the travelling modes of the two input photons. In a photon- number notation, the two photon 
states are taken to be the following (for simplicity we consider here the case of two independent input qubits, but the 
results are valid also in the more general entangled case): 

\1>)t = a\10) t + P\01) t 

|0)c = 7|lO)c + *|Ol) c ( A1 ) 

The qubit "unfolding" step corresponds to adding two empty modes for photon t and rearranging the four modes so 
as to obtain the following state: 

\ip)t = a|1000)t + /3|0010)i (A2) 

The first two t modes are then treated as one qubit (ti) and the final two t modes as a second qubit (£2), both of 
them initialized to logical zero, but with the possibility for each of them to be actually empty. Each of these qubits 
must now be subject to a CNOT gate, using the same c qubit as control. The action of the CNOT gate in the 
photon-number notation is described by the following equations: 

C>CNOT|10) c |10) t = |10) c |10) t 
[>CNOT|01) c |10) t = |01) o |01) t 

C>CNOT|10) c |01) t = |10) c |01) t (A3) 

^CNOT|01) c |01) t = |01) c |10) t 

However, in the present implementation of the quantum fusion we need to have the CNOT act also on "empty target 
qubits" , that is vacuum states. For these we assume the following behavior: 

C>CNOT|10) c |00) t = 77|10) c |00) t 

C>CNOT|01) c |00) t = r?|01) c |00) t (A4) 

where 77 is a possible complex amplitude rescaling relative to the non-vacuum case. A unitary CNOT must have 
1 1] I = 1, but probabilistic implementations do not have this requirement. The qubit fusion scheme works if the two 
CNOTs have the same 77. In particular the CNOTs implementation proposed by Pittman et al. (refs. 26-27 of main 
article) and used in this work have r\ = 1, so for brevity we will remove 77 in the following expressions. 



Let us now consider the entire input state given in Eqs. ( Al I: 



= (7|10) c + <5|01) c ) (a|1000) t + /?|0010) t ) (A5) 
= O7|10> c |10)ti|00) fl + 07|lO)c|OO) t i|lO)t2 + a<5|01) c |10) tl |00) t2 + /3<J|01) c |00) t i|10) t2 

where we have also split the overall four-dimensional target state in the two target qubits t\ and i 2 which will be 
subject to the two CNOT gates with the same control c. The subsequent application of the two CNOT gates, one 
acting on c and t\ and the other acting on c and t%, leads to the following state: 

|^)/ = C^CNOTa^CNOTi Y&)i 

= a 7 |10) c |10) H |00) t2 + 07|lO) c |OO)ti|lO)t 2 + a<J|01) c |01) t i|00) t2 + /W|01) c |00) tl |01) t2 (A6) 
If now we project this state on |+) c , so as to erase the c qubit, and reunite the t\ and i 2 kets, we obtain 

|*)t = (|V»> ® \<t>))t = «7|1000) f + a<5|0100) t + /3 7 |0010) t + /3(5|0001) t . (A7) 

which is the same as Eq. (5) of the main article. Since the c qubit measurement has a probability of 50% of obtaining 
|+), without feed- forward the described method has a success probability of 50% not considering the CNOT success 
probability. 
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If the outcome of the c measurement is |— ) c , we obtain the following target state: 

|*) t = O7|1000) t - a«5|0100) t + ^7|0010) t - ^|0001} t . (A8) 



This state can be transformed back into Eq. (A7) by a suitable unitary transformaton. Therefore, the success 
probability of the fusion scheme can be raised to 100% (again not considering CNOTs success probabilities) by a 
feed-forward mechanism. 



Appendix B: Quantum state fusion setup: full calculation 



In this section, the quantum process taking place in the quantum fusion setup is calculated step by step, considering 
the ideal case of perfectly identical photons. In a following Section, we will consider also the effect of a partial 
distinguishability of the photons. 

The optical layout is shown in figure 5] (or Fig. 2 of the main text). We label as c and t the input modes of the 
two photons to be fused and a the ancilla photon mode. By convention, the intermediate and output modes in the 
setup are given the same label in the parallel transmission through optical components, except for the unfolding 
step which splits mode t into modes t\ and t 2 . The input qubits are polarization-encoded photons, with horizontal 
H and vertical V linear polarizations standing for logical and 1, respectively. The output "fused" photon will be 
finally encoded in two spatial modes, t\ and t 2l and the corresponding H, V polarization modes. The polarizing beam 
splitters (PBS) are assumed to transmit the H polarization and reflect the V one and not to introduce a phase shift 
between the two components (a real PBS may possibly introduce a phase shift, but this can be always compensated 
with suitable birefringent plates, as was actually done in our experiment). The ancilla photon must enter the setup 
in the H polarization state. The half-wave plates (HWP) are all oriented at 22.5° with respect to the H direction, 
so as to act as Hadamard gates [if — ¥ (H + V)/y/2, V — ¥ (H — V)/y/2], except for the (yellow-colored in the figure) 
one located after the unfolding PBS along the t 2 output mode, which is oriented at —22.5°, so as to perform the 
transformation V — ¥ (H + V)/ \2. 

Let us now calculate the behavior of our fusion apparatus. The three-photon input state is the following: 

|**> = (a H t H c + ai H t V c + a 2 V t H c + a 3 V t V c )H a (Bl) 

where in the right-hand-side we omit the ket symbols for brevity and we have assumed the c and t photons to be in 
an arbitrary two-photon state, either separable or entangled. In particular, the case of separable qubits considered in 
the main article corresponds to setting a$ = cry, ot\ = aS, a 2 = ftj, a 3 = f3S, where (a, /3) and (7, S) are the coefficient 
pairs defining the two qubit states. As we shall see below, the symbols H and V should be actually taken to represent 
the photon creation operators (for the given polarization and spatial mode) to be applied to the vacuum state, in 
order to obtain the resulting ket state. The creation-operator interpretation is needed for the states involving two or 
more photons in a same mode, which in our case appear only as intermediate states but not in the final output. 

We now consider the effect of each optical component in sequence, indicating as subscript (s) the mode(s) on which 
the optical component is acting: 

(a Q H t H c + axH t V c + a 2 V t H c + a 3 V t V c )-j=(H a + V a ) 

PBS I 

^4 -^[(ao-fft + a 2 V t )H c H a + ( ai H t + a 3 V t )V c V a + { ai H t + a 3 V t )H a V a + {a H t + a 2 V t )H c V c )} 

hwp.hwp 1 „ gt + a2 y t) ( Hc + y c) ( Ha + Va) + {aiHt + a3Vt )( Hc _ Vc )( Ha _ Va ) 
2v 2 

+{a x H t + a 3 V t ){H 2 a - V*) + {a H t + a 2 V t )(H 2 c - V c 2 )} 

where in the last expression, we introduced the squared symbols H 2 or V 2 to denote a two-photon state for the 
given mode (the precise normalization convention here corresponds to considering the H and V symbols as creation 
operators for the given mode acting on vacuum, or equivalently to setting H 2 = v2|2)jj in the photon- number ket 
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Dua 



FIG. 5. Schematics of the quantum fusion apparatus (this figure is equivalent to Fig. 2 of the main manuscript and is 
repeated here for the reader convenience). Each red line in this scheme is a photonic spatial mode, corresponding to two 
possible photonic states given by orthogonal polarizations. Polarizing beam-splitters (PBS) are indicated by barred squares 
and assumed to transmit horizontal and reflect vertical polarizations; half-wave plates (HWP) are indicated by thin rectangles 
and are all oriented at 22.5° so as to implement Hadamard gates in the polarization space, except for the yellow-colored one 
which is oriented at —22.5° so as to be equivalent to a NOT gate followed by the Hadamard one; single photon detectors (D) 
are used to filter the useful output. 



notation, and similar). Continuing with the unfolding step in the t mode: 

^%^[(a H tl + a 2 V t2 )(H c + V c )(H a + V a ) + ( ai H tl + a 3 V t2 )(H c - V c )(H a - V a ) 
+{a x H tl + a 3 V t2 )(H 2 a - V 2 ) + (a H tl + a 2 V t2 )(H 2 c - V c 2 )} 

Note that the absence of a symbol for a given mode in a term corresponds to having the vacuum state in that mode 
(this again corresponds to the creation-operator notation). Next, we have 

HWPt,HWP t , 1,, 

i 2 -+-[(a H tl + a Q V tl + a 2 H t2 + a 2 V t . 2 )(H c + V c )(H a + V a ) 

+{aiH tl + a 1 V H + a 3 H t2 + a 3 V t . 2 )(H c - V c ){H a - V a ) 

+{a x H tl + a x V H + a 3 H t2 + a 3 V t , 2 ){H 2 a - V 2 ) + (a H tl + a Q V tl + a 2 H t2 + a 2 V t2 )(H 2 - V 2 )] 

Let us now consider the effect of each of the two PBS acting as CNOTs. We will also drop all terms which do not lead 
to one (and only one) output photon in each of the two modes c and a. The latter is a projection step (PS) ensured 
by the final photon detection. We thus obtain 

PBS PS 1 

" 1 ") -[{a H tl H a + a V tl V a + a 2 H t2 H a + a 2 V t2 H a )(H c + V c ) 

+( ai H tl H a - ai V tl V a + a 3 H t2 H a + a 3 V t2 H a )(H c - V c ) + a V a (H 2 - V 2 )] 

PBS C t,PS c 1 , 

>-(a H tl H a H c + a V tl V a H c + a 2 H t . 2 H a H c + a 2 V t2 H a V c + ai H tl H a H c - a x V tl V a H c 



+a 3 H t2 H a H c - a 3 V t2 H a V c 
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and finally all the exit half-wave plates: 

HWP HWP HWPt HWP* 1 

1 i> [(a +ai)(fr tl + V tl )(H a + V a )(H c + V c ) + (a -ai)(iT tl - V tl )(H a - V a )(H c + V c ) 

+ (a 2 + a 3 )(tf t2 + V t2 )(H a + V a )(H c + V c ) + {a 2 -a 3 )(H u - V t2 )(H a + V a )(H c - V c ) 
By expanding the products, we obtain the following final state: 

H H H V 

I*/) = -^-^(a H tl + a x V tl + a 2 H t2 + a 3 V t2 ) + -^(a H tl + a{V tl + a 2 V t2 + a 3 H t2 ) 

+ ~[^( aoVt i + a ^ H ti + a 2 H t 2 + a 3V t2 ) + ~i^( aoVt i + a l H ti + a 2 V t2 + a 3 H t 2 ) 

It is clear from this expression that if the detectors of the a mode and c mode both detect H photons, then in the t 
modes we obtain directly the merged photon (this occurs with a probability of 1/32): 

|*/) = a H tl + ai V tl + a 2 H t2 + a 3 V t2 (B2) 

The other cases can be transformed into this by a simple feed-forward mechanism: a V detection on the a channel 
will indicate that we must apply a a x operator on the t\ mode, while a V detection on the c channel indicates that 
we must apply it on the t 2 mode. 

By exploiting all cases, we have a total success probability for the fusion of 1/8. In principle, the scheme may 
become quasi-deterministic by adopting a KLM-like approach with teleportation and a very large number of ancilla 
photons. 

It is remarkable that we could apply the one-ancilla CNOT in the first step and the outcome was not ruined by 
the presence of intermediate double-photon modes. This occurs because the double-photon modes running in the a 
or c channels always have two photons with orthogonal polarizations. The half-wave plates will then transform them 
into HH or VV double photons, which are either both reflected or both transmitted at the following PBS, so that 
the final measurement-filtering stage will not include these cases. In other words, the polarization Hong-Ou-Mandel 
effect will make sure that these "wrong" channels do not contribute to the final projected state. 

Another remarkable fact is that in principle the projection step does not require a destructive post-selection if the 
detectors are able to distinguish between one- and two-photon events, as in this case the detection of the t photon is 
not necessary to determine the success of the process. 



Appendix C: Quantum state fission 

The general scheme of the quantum state fission process is given in Fig. lc of the main article. Adopting the photon 
number notation, we assume to have an input photon encoding two qubits in the four-path state 

\tpi) = a |1000) + ai|0100) + a 2 |0010) + a 3 |0001) (CI) 

We label this input photon as c (for control). We also label the first two modes as c\ and the last two modes as c 2 . 
We also have another photon, labeled as t (for target), that is initialized in the logical zero state of two other modes, 
so that the initial two-photon state is the following: 

= (« |10) C1 |00) C2 +ai|01) Cl |00) C2 +a 2 |00) Cl |10) C2 + a 3 |00) Cl |01) C2 )|10) t 

We now apply the two CNOT gates in sequence, using the t photon as target qubit in both cases and the c\ and c 2 
modes of the c photon as control qubit in the first and second CNOT, respectively. In order to do these operations 
properly, we need to define the CNOT operation also for the case when the control qubit is empty. As for the previous 
case of empty target qubit, the CNOT outcome in this case is taken to be simply identical to the input except for a 
possible amplitude rescaling, i.e. 

tWr|00) c |10) t = r?|00) c |10) t 

C>CNOT|00) c |01) t -r7|00) c |01) t (C2) 

which is what occurs indeed in most CNOT implementations. Our fission scheme works well if the two CNOTs have the 
same r\ factor. For brevity we simply assume r\ = 1 in the following, which is the case of the CNOT implementations 



12 



we will utilize. Hence, we obtain 

E>gno Ti C>gnot 2 \%) = oo|10) Cl |00) C2 |10) t + a x |01) Cl |00) C2 |01) t + a 2 |00) Cl |10) C2 |10) t + a 3 |00) Cl |01) C2 |01) t 

Now we need to erase part of the information contained in the control photon. This is accomplished by projecting 
onto |+) combinations of the first and second pairs of modes, while keeping unaffected their relative amplitudes. In 
other words, as shown Fig. lc of the main article, we must apply an Hadamard transformation on both pairs of modes, 
and take as successful outcome only the logical-zero output (corresponding to the | +) combination of the inputs) . The 
projection is actually performed by checking that no photon comes out of the |— ) (i.e., logical one) output ports of the 
Hadamard. The two surviving output modes arc then combined into a single output c-photon qubit, which together 
with the i-photon qubit form the desired split-qubit output. Indeed, we obtain the following projected output: 

|*/) = a |10) c |10) t + a 1 |10) c |01) t + a 2 |01) c |10) t + a 3 |01) c |01) t (C3) 

which describes the same two-qubit state as the input, but encoded in two photons instead of one. 

The proposed scheme for fission has a 50% probability of success, not considering the CNOT contribution. It might 
be again possible to bring the probability to 100% (not considering CNOTs) by detecting the actual c-photon output 
mode pair after the Hadamard gates by a quantum non-demolition approach or in post-selection, and then applying 
an appropriate unitary transformation to the t photon. Alternatively, one can in principle use the KLM approach 
with many ancilla photons and teleportation to turn the scheme into a quasi-deterministic one. 




FIG. 6. Schematics of the quantum fission apparatus. As in Fig. SI, each red line in this scheme is a photonic spatial mode, 
corresponding to two possible photonic states given by orthogonal polarizations. The input photon enters in the modes C1-C2. 
Splitted ouput photons exit along mode t and either c or c'. Barred squares indicate PBSs; thin rectangles indicate HWPs, all 
oriented at 22.5° except for the green-colored one which are oriented at 45° so as to be equivalent to a NOT gate swapping H 
and V . 

A possible specific apparatus for quantum state fission is shown in Fig. S2. In this setup, output photon internal 
quantum states are defined in the polarization space, as for the fusion setup case. The four modes of input photon c 
are defined by the two spatial modes c\ and c 2 and the two polarizations H and V. Both the target photon t and the 
ancilla photon a must enter the setup after being initialized to logical zero, i.e. iJ-polarized. 

The input three-photon state is the following: 

|*,) = (a H cl + a{V Cl + a 2 H C2 + a 3 V C2 )H t H a , (C4) 

The calculations of the optical component effect on this state can be carried out similarly to the case of the fusion 
setup and we will not repeat them here. We give only the final output obtained after all components and after 
projection on a subspace in which there is only one photon per output mode (i.e., one in a, one in t, and one in either 
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c or c 



|*/> - ^(a H t H c + a x V t H c + a 2 H t V c + a 3 V t V c ) + -^=(a H t H c - ai V t H c + a 2 H t V c - a 3 V t V c ) 

\{a V t H c , + ai H t H & + a 2 V t V d + a 3 H t V d ) + -^=(-a V t H c > + ai H t Hc - a 2 V t V c , + a 3 H t V rJ 



4^2 4\/2 

This expression shows that again the success probability is of 1/32 without feed-forward. If we detect the ancilla 
photon in the H polarization and a photon coming out on the c channel, then the final state is the desired one: 

|*/) = a H t H c + cctVtHc + a 2 H t V c + a 3 V t V c (C5) 

Also in this case a feed-forward mechanism can increase the success probability. In particular, if the output ancilla 
is V^-polarized, then we must change the sign of the V t mode. If the control photon comes out from the d channel, 
instead of the c channel, then we must swap Vt and H t . The ancilla detection can be done without disturbing the 
c and t output photons, so with a real feed-forward mechanism we can double the success probability. The control 
photon detection can only be done destructively (not considering quantum non-demolition possibilities) and hence a 
true feed-forward is not practical (in principle we could post-pone the measurement of the target photon and correct 
its polarization state, but in practice this is not useful). Once again, however, a KLM-like approach based on a large 
number of ancilla photons can in principle turn this process into a quasi-deterministic one. 



Appendix D: Modeling photon distinguishability 

The three-photon state used as input in the quantum fusion experiment was conditionally prepared by exploiting 
the second-order process of a type-II spontaneous PDC source, generating two photon pairs, with one of the four 
photons used only as trigger. A crucial parameter in the four-photon component of the generated state is given by 
the partial distinguishability of the two generated photon-pairs. In this section we develop a theoretical model for 
the experiment which takes into account an imperfect indistinguishability between the three photons. To model the 
effect of spectral and temporal distinguishability, the generated four-photon density matrix in the two output spatial 
modes ki and k 2 can be written as follows [31]: 

g= P mm + (i-v)m {A) m ® iv_> (1 %-i, m 

where: 

\if>i) = ^=(\HH) 1 \VV) 2 - \HV) 1 \HV) 2 + \VV)x\HH) 2 ), (D2) 

m^ = ^(\H)f\V)f-\V)f\H)f), (D3) 

Here, the parameter p is the fraction of events in which two indistinguishable photon pairs are generated in the \ip-} 
state. Conversely, (1 — p) is the fraction of events in which two distinguishable photon pairs are emitted by the source 
(labeled by the superscript i — A,B). Hence, the parameter p represents the degree of indistinguishability of the 
four-photon state. Referring to the experimental scheme of Fig. 3 in the main text, the output state is analyzed in 
polarization by selecting the contributions in which two orthogonally-polarized photons are generated on each spatial 
mode. The corresponding weights on the states l^i) and l^ 2 .)^ <8> lip-)^ are respectively 1/3 and 1/2, leading to 
an overall fraction of events with three indistinguishable photons given by r = 2p/ (3 — p). 

Let us now proceed with the evolution induced by the quantum fusion apparatus. Without loss of generality, we 
consider the case in which the photons in the ancillary output modes are projected in the \H) polarization state. We 
first observe that due to the polarization analysis of the source, the control and the target qubit always belong to the 
same photon pair since they are selected with orthogonal polarizations in the two different spatial modes. When two 
indistinguishable photon-pairs are produced by the source, the input state in the apparatus takes the form 

I^W) = H a (aH t + /3V t )(jH c + 5V C ), (D4) 

where we have resumed here the shortened notation in which the ket symbols on the left-hand-side are omitted. In 
this case, with probability P in( j = 1/32, the output photon will emerge only in the desired output state, corresponding 
to the qubit fusion. When the two pairs emitted by the source are distinguishable, instead, the ancillary photon 
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belongs to a different photon pair with respect to the control and target photons. The input state in the qubit fusion 
apparatus can then be written as follows: 



Hi A \aH^+pV^){ 1 HW+5V^) 



(D5) 



where the superscripts (A) and (B) distinguish the photons and hence remove the interference effects between the 
ancilla and the other two photons, thus leading to spurious contributions in the output state and decreasing the 
fidelity. The overall input state is then obtained as: 



Qp = »'|?Aind)(V , md| + (1 - r)\i>d)(ipd\ 



(D6) 



The output state can be evaluated by applying to this input the same sequence of operations of the fusion apparatus 
reported above and finally keeping only the terms corresponding to the detection of two horizontally-polarized photons 
in the ancilla and control output modes. 

We skip the intermediate calculations and in the following just give the results we have obtained for the four state 
bases which were tested in our quantum fusion experiment. 

Basis (i), using {\H) t , \V) t ; \H) C , \V) C }. The input/output probability matrix for this basis reads: 



p(i) 
P 



3+p 3— p 3— p 3—p \ 
2-8p 12-8p 12-8p 12-8p \ 

10 



V 





3-p 
12-8p 





3-P 
12-8p 



1 

3—p 3+p 
12-8p 12-8p/ 



(D7) 



The matrix rows correspond to the following input two-photon states: {\H t H c ), \H t V c ), \V t H c ), \V t V c )}. The matrix 
columns correspond to the following single-photon output states: {|0) = \H) tl , |1) = \V) tl , |2) = \H) t2 , |3) = |V)t 2 }. 
Of course, the matrix is normalized so that the sum for each row equals unity. 

Basis (ii), using {\H) t , \V) t ; |+) c , |— ) c }- The input/output probability matrix reads: 



p(ii) 

v 



/ 3+p 


3-P 





3-P 


/ 9-5p 


9-5p 


9-5p 


3-p 


3+p 





3-P 


9-5p 


9-5p 


9-5p 





3-P 


3+p 


3-P 


9-5p 


9-5p 


9-5p 


V 


3-P 


3-P 


3+p 


9-5p 


9-5p 


9-5p 
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The rows correspond to the following input two-photon states: {\H t + c ), \H t — c ), \V t + c ), \Vt— c )}- The columns corre- 
spond to the following single-photon output states: {|0') = 11') = |— |2') = |+)t 2 , |3') = |— )t 2 }- 
Basis (iii), using {\H) C , \V) C ; \+)t, \—)t}- The input/output probability matrix reads: 



p(iii) 
P 



4(9-5p) 
3(1-P) 

4(9-5p) 
3(1-P) 

4(9-5p) 
3(1-P) 



3(1-P) 
4(9-5p) 

15+p 
4(9-5p) 
3(1-P) 
4(9-5p) 
3(1-P) 



\4(9-5p) 4(9-5p) 



9(1-P) 
4(9-5p) 

9(l-p) 
4(9-5p) 

21-5p 
4(9-5p) 

9(1-P) 
4(9-5p) 



9(1-P) \ 
4(9-5p) 1 

9(l-p) 
4(9-5p) 

9(l-p) 
4(9-5p) 

21-5p 
4(9-5p)/ 
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The rows correspond to the following input two-photon states: {| + t H c ), | + t V c ), \ — t H c ), \ — t V c }}. The columns 
correspond to the following single-photon output states: {|0") = \H) t+ ,\l") = \V) t+ ,\2") = |^) t _,|3") = \V) t _}, 
where the output modes t + and t_ are defined according to the rule \H) t± = (\H) tl ± \H) t2 )/\/2 and analogously for 
the other polarizations. 

Basis (iv), using {|+) c , |— ) c ; \+)t, \—)t}- The input/output probability matrix reads: 



p(iv) 

p 



/ 3+p 
' 6-2p 



V 



o 

3+p 
12-8p 
3(1-P) 
12-8p 



3(1-P) 
6-2p 
9(1-P) 
12-8p 
2(3-p) 
12-8p 
3(1-P) 
6-2p 



\ 


3(1-P) 
12-8p 

3+P 

6-2p / 
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The rows correspond to the following input two-photon states: {| + t + c ), \ H 
correspond to the following single-photon output states: {|0"') = |+)t + , 
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FIG. 7. Input/output probabilities for the different bases tested in our experiment, for a degree of indistinguishability p = 0.77 
of the photon pairs: (a) basis (i); (b) basis (ii); (c) basis (iii); and (d) basis (iv). 
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FIG. 8. Average quantum-state fusion fidelity in the four bases tested in our experiment, as a function of the indistinguishability 
parameter p. 



The value of the indistinguishability parameter p applicable to our photon source has been obtained by optimizing 
the agreement between the experimental results and the model predictions in the case of basis (ii), which shows 
a marked non-uniformity of the fidelity. We obtained p — 0.77, in good agreement with the results of the direct 
Hong-Ou-Mandel visibility measurement. 

In Fig. [7] we report the theoretical input/output probabilities for the states of the bases (i)-(iv) and for a degree of 
indistibuishability p — 0.77 of the photon pairs. From these results we can also evaluate the average fidelity of the 
qubit fusion process for the entire chosen set of 16 input states, which reads 

9-5p 
and is shown in Fig. [8] 

Finally, in Fig. [9] we report the average predicted fidelities for each tested basis compared with the results of our 
experiment. 
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FIG. 9. Average measured fidelities (red bars) for the four photon-pair bases listed in Fig. [4] compared with the predicted 
fidelity according to our model (green bars). 



